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(54) Dual storage controllers 

(57) A primary controller (1 04) operates to transmit 
write data and a write time to a secondary controller 
(109) in the earlier sequence of the write times after 
reporting a completion of a request for write to a 
processing unit (100). The secondary controller ( 09) 
stores the write data and the write time transmitted from 
the primary controller (104) in the cache memory (108). 
At a time, the secondary controller (109) stores the write 
data in a disk unit (105) in the earlier sequence of the 
write time. These operations make it possible to guaran- 
tee all the write data on or before the reference time. 
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[0001 1] The present invention relates to dual writinc of 
data to be executed through the effect of two coders 
In particular the present invention is effete m the 
case that a long d.stance exists between the two con 
troHers so that a delay takes place in transferring data 
between the controllers. 

[0002] The following techniques have been disclosed 
as the prior art of the present invention d,sc,osed 

ofi7?i«*i h Tht \ Eur °P ean P«ent Publication No. 
0671686A1 has d.sclosed a technique of doing dual 
writing through the effect of controllers spaced from 
each other by a long distance. In this technique, one 
conf er guarantees data on a disk therein if toe other 
controller ,s broken by a disaster such as an earth- 
quake. The technique disclosed in EP-0671686A1 is 
arranged so that a primary controller directly receives 
write data from a host computer, transfers the recced 
write data t0 a secondary contro|)er (ocated 

Satr to an a h r r rts comp,etion ° f receipt ° f tne s 

data to a host computer. This is a quite excellent 
method from a viewpoint of data securj becZse the 
data to be stored in the primary controllers con^Tetely 
equ^lent to the data to be stored in the secondly cot 
trailer However, a longer distance existing between two 
ST 'f * makes * da * *™fer time between the con 
troHers far larger This method therefore has difficulty on 
perfomiance to be solved if one controller is far awaj 
from the other controller. y 

No°?67 29 Sli E T P r! atent App,ication ^cation 
No. 0672985A1 also has disclosed a technique of doino 

from "2? nr SkS T lf aded in »» contro,lars »r "S 
SSJS 6r The teChnique disc,osed *n EP- 

S2f ,S arran96d 50 mat a P rimar V stroller 
directly recedes write data from a primary host compu- 
ter, immediately after receipt of the write data, raports 
complete of the receipt of the write data to the primary 
host computer. In the EP-0672985A1, a copy S Tfhe 
write data received by the primary controller isTead out 
to the primary host computer. According to the inven- 

I om I T '" S 9iV6n to the write data reived 

from the primary host computer for the first time The 
time stamp indicates a time when a request for writing 
he write data ,s issued. When the copy of the write ita 

Is" oafs^ 6 Primary ^ *ne write Se 

s passed to the pnmary host computer as well. Then 
the primary host computer sends the copy of the write 

[0005] When the secondary host computer receives 
the wn e data and the write time, information such 2 
" e data - s writte n on the disk for the control pur- 
Pose. Further, the write data is written on the disk on the 
secondary side in the sequence of the write time by 

rnnn*? 9 *! ! time 9iven t0 eacn writ e data record. 
[0006] In the EP-06729851A1. the secondary host 



computer performs the foregoing procedure, because it 
s not desirable to leave the intermediate resuftfof the 
fransactions normally used by an online system ^ 
example, considering a transaction of transferrino a 
5 bank deposit from a bank account A to anoter bank 

ba?k Tccounf deP ° Sit iS ^ 

resoo^Z, ^ " 6 St8te ° f havin9 trans,erred "o cor- 
responding deposit to the bank account B is not left 
This means that no intermediate result of the fransS 
w ton , is left. Normally, the recovery unit is a Vans^fon 

esutt of he transaction is a quite significant obstacle 
[0007] m turn, why no intermediate result of the trans- 

descnbed below. The disks on which data is dually writ 
ten contain a disk for storing a database such as 
account information and a disk for storing a journal 
where an update history of the transaction is | efl "he 

- ana ^eThf iL' S * ™™ is ^ed to 

analyze the journal so that the update result of the 

unfinished transaction is returned to the state before toe 

^TZ™^™""* 6 resu,t - the franSLn 
Wnt6 date Writ,en °n the disk loaded in the 
2s SnL C ? r ° ,ler " 6ffeCtive "* in *™ cases suS 
datat^ 

data is broken. The secondary controller does not store 

I a bUt Can 9Uarantee tne write ^ up 

IhT 6 - Hence ' apparently, the host computer 

so JSSf52SS nf . state to the ,ai,ed ** a * • *£ 

Jm ' S auaran teed. As a result, by usino 
the disk for storing the journal, loaded in the second^ 
controller and thedisk for storing the datoble toe S ^ 
Jar process to the recovery to be execute? when ™e 

Snri p 1 6 ^ ansact, °n may not be left. 

II i C hrml the *! COntr °" er COntains a non-vol- 
atile cache memory and provides a write-after tech 
n-que. toat is, a technique of writing SJSLSt 
receded from the host computer onto a no^olatite 

to the host computer. The non-volatile cache memorv fe 
h^hly re,,ab,e, so that the data may be sufficiently £*" 
n S ^ stor,n 9 the write data in the cache. 
[0009] The technique disclosed in the EP-0672985A1 
I ar T 9ed 10 SUPPreSS the de 9 rade ° f the p2S22 

conS.i° me ^ iS ' OSt 6VSn if 8 stance betwe^S 
controllers is expanded. Further, it does not leave toe 
intermediate result of the transaction 



so SUMMARY OF THE INVENTION 

oS^Ai OWSVer ' * he teChnique disclosed ^ ^e EP- 
0672985A1 ,s arranged so that the primary host compu- 
ter reads out data and transfers the data to the second 
* aryhost computer. Hence, unlike the EmSSSK 
arranged to directly transfer the write data be^een the 
controllers this technique needs one more daTa traS 
fers. Further, an I/O process to and from a storage 
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medium such as an MT is required to be done. 
m01 1 ] It is an object of the present invention to pro- 
vide a technique which provides functions of directly 
transferring write data between controllers like the EP- 
0672985A1, suppressing degrade of the performance 
to a minimum even if the distance between the control- 
lers is made longer, and leaving no intermedia e result 
of a transaction. Further, no execution of the . I/O proc- 
ess of control information to and from disks is required ^ 
for enhancing the performance. 
10012] Later, the description will be oriented to how 
the present invention realizes the aforementioned 

100131 According to the present invention, the host 
computer gives a write time to each write data record is 
when it issues a request for write to a primary controller. 
When the primary controller receives the write data 
from the host computer, it reports the completion of the 
receipt to the host computer. Then, the primary control- 
ler sends the write data records and the write times to so 
the secondary controller. At this time, the primary con- 
troller operates to send the write data records to the 
secondary controller in the sequence of the writing 
times. These series of operations make it possible to 
suppress the degrade of the performance to a minimum 25 
even though the distance between the controllers is 
made longer. 

[001 41 The secondary controller operates to store the 
write data received from the primary controller onto a 
non-volatile cache memory. This makes it possible to 30 
guarantee the write data without any I/O process of con- 
trol information to and from disks. 
F001 51 The secondary controller can guarantee the 
write data up to a certain time by referring to the 
received write time. This makes it possible to leave no 35 
intermediate result of the transaction. 



cuted when a master secondary controller collects 
information about the write time from the primary 
controller. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

[0017] Hereafter, the description will be oriented to 
embodiments of the present invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 
[0016] 



Fig 1 is a diagram showing a general system con- 
figuration according to a first embodiment of the 
present invention; 

Fig. 2 is a view showing a format of write data man- « 
aging information; 

Fig 3 is an explanatory view showing a logical d.sK 
Fig 4 is an explanatory view showing a process of 
a secondary write data destaging unit in the case of 
executing the transfer of the write data from the pn- so 
mary controller to the secondary controller; 
Fig 5 is a diagram showing a general system con- 
figuration according to a second embodiment of the 
present invention; 

Fig 6 is a diagram showing a general system con- ss 
figuration according to a third embodiment of the 
present invention; and 

Fig. 7 is a diagram showing a process to be exe- 



(A) First Embodiment 

[0018] At first, the first embodiment will be described 

below. ,. „ , 

[0019] Fig. 1 illustrates a general configuration of the 
first embodiment. The system of the first embodiment is 
configured to have one or more processing units 100, 
one primary controller 104. one or more disk units 105 
connected to the primary controller 104, one secondary 
controller 109, and one or more disk units 105 con- 
nected to the secondary controller 109. The processing 
unit 100 is composed of a CPU 101, a main storage 
102 and a channel 103. The primary controller 104 
contains a control memory 107 and a cache memory 
108 The control memory 107 and the cache memory 
108 are non-volatilized. For enhancing the reliability, 
each memory may be dualized. The cache memory 108 
or the control memory 107 is composed of a semicon- 
ductor memory. As compared with the disk unit 105. the 
memory 107 or 108 provides a faster accessing capabil- 
ity by one or two digits. The primary controller 104 oper- 
ates to transfer data between the processing unit 100 
and the disk unit 105. Further, according to the present 
invention, the primary controller 104 provides a function 
of transferring data with the secondary controller 109. 
Or the primary controller 104 contains one or more 
directors 106, each of which may operate to transfer 
data between the processing unit 200 and the disk unit 
205 and between the secondary controller 109 and the 
40 director 1 06 itself. The internal arrangement of the sec- 
ondary controller 109 is likewise to that of the primary 
controller 104. 

[0020] The write data managing information 1 13 cor- 
responding to the write data record 112 is created on 
the control memory 1 07. 

[0021 ] When the processing unit 1 00 issues a request 
for write 1 10 to the primary controller 104. the process- 
ing unit 1 00 operates to give a write time 1 1 1 to the write 
data record 112. The write time 111 represents a time 
when the request for write 110 is issued. The primary 
controller 104 and the secondary controller 109 can rec- 
ognize the sequence of the requests for write 110 
issued by the processing unit 100 by referring to this 
write time 1 1 1 . If two or more processing units 1 00 are 
provided, the write time 111 is defined by using the 
common clocks among the processing units 100, so 
that the primary controller 104 and the secondary con- 
troller 108 can recognize the sequence of the requests 
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m«T I ° ,SSuedb y di «^nt processing units 110. 

f 1 • 9 , 2 , '" UStrates a imposition of write data 
managing .nformation 113. Herein, in particulj the 
description wil, be oriented to the information d^ecfly 
concerned with the present invention. In the presert 5 

f Sn"' 3 d ' Sk t0 be w "en the proSSfno 

unit 100 issues the request for write 110isrefe °Stoas 
a log,cal disk. A logical disk ID 120 indicates a T,mh!r 

that In 9 ' 03 ' indiCat6d * the P^S S 
that the corresponding write data is written on the logi w 
cal disk and ,s contained in the request for wr te 110 
This invention does not require the one-to-oTcor£ 
spondence between the logical disk recognized b? he 
processing unit 100 and the disk unit 105 (phyS 
disk). As ...ustrated in Fig. 3 , the logical disk may S 15 

Z?"? *° ° r m ° re disk units ""OS. Further the 
logical disk may contain redundant data and w 
emp,oy a RAID (Redundant Array oftex^e d£S 
arrangement. A numeral 121 denotes Vwrite a^rtS 
that ,s «nforma, lon (for examp|e informatjon 

areaof1Mbytestartingfromaheadofthe.iTca.dlk) 
for ,nd,cat,ng an address inside of a logical dik where 

tained ,n the request for write 110. A numeral ?22 
denotes a wnte data lengtt, that represents a length 2S 
the corresponding write data and is contained in the 
request for write 110. These pieces of inSSion art 
all contained in the normal request for write 110 A 
numeral 123 denotes a write data pointer that is a 

dSL fK The ^ wrrte tom e 111 has been already 
described above. The write time 1 1 1 to be given to the 
request for write 110 is a feature of the Con a 

STTf w ♦ den ° teS 3 neCessit y bit that indicates tha^ 
me wnte data is required to be transferred to the sec 35 
ondary controller. (Herein, the bit 124 is refenid to as a 

trSr ty ^ If V ^ neCeSSit y bit 124 * '"formation 

?2 lZV! nte T Sfer 0i me "^ponding write date 
1 12 to the secondary controller 1 09 

I^ 3] , An0ther it6m ° f inform ati°n contained in the 40 

fS Wy 1 °7 iS 3 Secondar y "WW disknumber 

lcx£Ja sk ith ,S ' 0Cated l ° COrres P° nd to the 
og.cal disk of the pnmary controller 104 and contains 

numbers of the secondary logical disk of the ^e 

spondmg logical disk, which are paired for dual writing. 45 

hat is. a number of the secondary controller 109 fo 

loading he secondary logical disk and a logical disk 

number located in the secondary controller lO^of t he 

does not have any pair for dual writing, a null value for so 
indicating invalidity is put into the logical disk 
[0024] The control memory 107 of the secondary con- 

1 13 as well. The information may have the same format 
as the write data managing information 1 1 3 Tth e ph 55 
mary controller 104. in which the necessity bit iL 
constantly off. The control memory 107 of the second 
ary controller 109 contains a primal 



EP 0 902 370 A2 



6 



ZSLh * P 6S f nt in,orn *«on is located to corre- 
spond to the logical disk of the secondary controller! mm 
and contams the numbers of the primary STdisk nf 
the corresponding logical disk, whfoh are S fc d J2 

for S that ^ 1,16 number of the p*5SS£12 

for loading the primary logical disk and the looical di^ 

iTS n 0 Tr mar r ntro,,er l04of "^5SSf 

.cal disk. Of course, if the logical disk does not contain 
any pair for dual writing, a null value for indicSno Va- 
lidity is put into the logical disk 
[0025] The primary controller 1 04 provides a nrimarv 
write da ,a receiving unit 130. whichTste^ when * 
receives the request for write 110 from 
unit 100 At«rc* rom the Process ng 

unit 100^ At first, the received write date 1 12 is stored in 
^e cache memory 108 (step 131). Next the £Ej 

ZTJ^ reCe,V ! n9 unit 130 ^^ates to secure 2 
wnte data managing information 1 13 inside of the Jlf 
trol memory 108 for Droee^inr, «, TOn " 
fsten .o» p„hI P roces s<ng this request for write 

« w ■ the informati °n such as the write 
time 111 contained in the request for write is stored 

selT. 7? ^ mana9i "9 '"foroiation t ° 3 ,^ 
Jrt. ,- y ' ^completion of the requestor 

al^f? PrOC6SSin9 makes « Possible 00? 

SSk^XT DeCaUSe "° access is re d^ed for the 

I t Th6 Pr ° CesS of writi "9 the Sta data 1 12 
onto the disk unrt 105 is executed h„ • 

tro,,er104ata,atersta^ 

02TTh?nT° ,,er - ^ itis ^deecr?bXSr 
[0026] The primary controller 104 has a primary write 
data transmitting unit 140. which provides a SioTof 

ier 109. At first, the write data record 1 12 at the Mriiact 

S e i T- in K c,uded in the wfte data «2S5SS 

ton 1 13 in which the necessity bit 124 is set fe tmnSS 
ted to the secondary controller loT^VSTS 
dual writing focated therein by referring tc^ ^the'corrt 

£ro? t H ea:,ndary ' 09iCa ' disk nu "*er 114 The 
tength of the wnte data 112 and the address inside of 
the secondary logical disk where the write date is ? fhe 

™n™iry ? by referrin9 to ■^ss n^ 6 

^Next IhT ^ man99in9 informa « on 1 13 (step 
14 ). Next, the primary write data transmitting unit 140 
waits for a report on the completion from the second*™ 
controHer 109 (step 142). When the report on thT^m" 

K;'5: , 2 the r essify St 

^iep i4d;. I hen, the operation aoes hark ^* 
^hthenextwritedatato 9 ^^^^^^^^^^^^^ 

S a Jll ! eCOndary COntro,,er 109 "es a secondary 

write data 1 1 2 T r 9 r rt w 60 / WhiCh iS Started w "en the 
wme date 112 ,s received from the primary controller 
104. The processing content of the secondary write 

write datar ^ 160 iS ,ik6wise to *at of fiC 
write data receiving unit 140 exront +h=+ *^ ^- y 

<a.n a process of 5etBng necesslt) , ™£ 
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161) After the write data 11 2 from the primary controller 
104 is written in the cache 108, the completion of the 
write is reported to the primary controller 104. 
[0028] The secondary controller 1 09 has a secondary 
write data destaging unit 1 70, which provides a function 
of writing the write data 1 1 2 to the disk unit 1 05. At first, 
some write data records 112 included in the write data 
managing information 1 13 are determined to be written 
in the disk unit 105 in the sequence from the earliest 
write time. Then, a necessary calculation is executed to 
determine the disk unit 105 where the write data 
records are to be written and the write address. The 
method of calculation is not described in detail, because 
it is used for the normal RAID (step 171). Next, two or 
more requests for writing the write data 1 12 to the disk 
unit 105 are issued to the disk unit 105 in parallel (step 

1 72) Next, the secondary write data destaging unit 1 70 
waits for a report on the completion of the request (step 

173) . After the reports on the completion of all the 
requests are received, the operation goes back to the 
step 171 at which the next write data 113 to be 
destaged to the next disk unit 105 is tried to be found. 
[0029] The transmitting sequence of the write data 
113 from the primary controller 104 to the secondary 
controller 109 is the sequence of the write time 111. 
Hence, the secondary controller 109 enables to gener- 
ate a state that it can hold all the write data records 1 1 3 
on or before a reference time and cannot hold all the 
write data records 113 after the reference time. This 
operation makes it possible for the secondary controller 
109 to do a recovering process without leaving any 
intermediate result of a transaction if the primary con- 
troller 104 is broken. Further, on the secondary control- 
ler 109, the control information such as the write data 
1 1 2 and the write time 1 11 is held in a non-volatile sem- 
iconductor memory such as the cache memory 108 and 
the control memory 107. Hence, no large overhead is 
burdened on the performance. 

[0030] The aforementioned process may not provide 
sufficient performance because the transfer of the write 
data 1 1 2 from the primary controller 1 04 to the second- 
ary controller 109 is serialized. Fig. 4 illustrates the 
process executed in the case of transferring the write 
data 1 12 from the primary controller 104 to the second- 
ary controller 109 in parallel. In each processing unit, a 
primary write data transmitting unit a300, a primary ref- 
erence time transmitting unit 170, a secondary refer- 
ence time receiving unit 180, a secondary write data 
destaging unit a310, and a primary fail-time data scrap- 
ping unit 190 are changed from the configuration in 
which the transfer is serialized. 

[0031] Hereafter, the processing flow of the primary 
write data transmitting unit a300 will be described. At 
first some write data records 112 included in the write 
data managing information 113 in which the necessity 
bit 124 is set are transferred in parallel to the secondary 
controller 109 having a pair for dual writing located 
therein by referring to the corresponding secondary log- 



ical disk number in the sequence from the earliest write 
time (step 301). Next, the primary write data transmit- 
ting unit a300 waits for a report on the completion of 
each write from the secondary controller 1 09 (step 302) . 
5 After all the reports are given back, the necessity bit 124 
included in the corresponding write data managing 
information 1 13 is set off (step 303). Then, the operation 
goes back to the step 150 at which the next write data 
1 12 to be transmitted is tried to be found. 
w [0032] The execution of the parallel transfer of the 
write data records 1 1 2 may make each write time 1 1 1 of 
the write data records 112 be out of the sequence. 
Hence, it is necessary to recognize the reference write 
time 111 for determining the write data 112 to be 
15 destaged by the secondary controller 109. In this case, 
the write data 1 12 to be destaged is made to have the 
earlier write time 111 than the reference time corre- 
sponding to the earliest write time 1 1 1 among the write 
data included in the write data managing information 
20 113 where the necessity bit 124 is set on, which are ail 
contained in the primary controller 1 04. This is because 
all the write data 112 having the earlier write time 1 1 1 
than the reference time are held on the side of the sec- 
ondary controller 109. On the other hand, the write data 
25 records 1 12 having the write time 1 1 1 on or before the 
reference time should not be destaged. If the primary 
controller 1 04 is broken, the write data records 1 1 2 have 
to be broken without being destaged. 
[0033] The primary reference time transmitting unit 
30 170 provides a function of transmitting the destageable 
reference time to the secondary controller 1 09. As men- 
tioned above, the reference time is the earliest write 
time 111 included in the write data managing informa- 
tion 113 where the necessity bit 1 24 is set on. 
35 [0034] The secondary reference time receiving unit 
180 operates to store the reference time received from 
the primary controller 104 in the control memory 108 as 
a destageable time 185. 

[0035] Fig. 4 illustrates a processing flow of the sec- 
40 ondary write data destaging unit a310 executed in the 
case of transmitting the write data records 112 from the 
primary controller 1 04 to the secondary controller 1 09 in 
parallel. The different respect of this flow from the 
processing flow shown in Fig. 1 is that the condition for 
45 selecting the write data 1 12 to be destaged includes a 
condition for determining if the write time 111 is on or 
before the destageable time 185 (step 311). 
[0036] The primary fail-time data scrapping unit 197 
provides a function of scrapping the write data 112 hav- 
so ing the write time 111 later than the destageable time 
185 when the primary controller 104 is broken (step 
191). 



55 



(B) Second Embodiment 

[0037] Next, the second embodiment will be described 
below. 

[0038] Fig. 5 illustrates a system configuration accord- 



5 



9 



EP 0 902 370 A2 



10 



ing to a second embodiment of the present invention. 
The difference between thefirst.and the second embod- 
iments is the number of the primary controllers 1 04. The 
system of the first embodiment is configured to have 
one primary controller 104 and one secondary control- 
ler 109, while the system of the second embodiment is 
configured to have two or more primary controllers 104 
and one secondary controller 209. 
[0039] The provision of two or more primary control- 
lers 1 04 makes the write time 1 1 1 of the write data 1 12 
received from one primary controller 104 shifted from 
that received from another primary controller 1 04 on the 
side of the secondary controller 109. The latest write 
time 111 of the write data 112 received from one pri- 
mary controller 104 (for example, the primary controller 
a) is assumed as a time a, while the latest write time 

1 1 1 of the write data 1 12 received from another primary 
controller 104 (for example, the primary controller b) is 
assumed as a time b. In this assumption, if the time a is 
earlier than the time b, the primary controller b may hold 
the write data 1 13 of the time later than the time a but 
earlier than the time b. As mentioned above, in order to 
erase the intermediate result of a transaction, it is nec- 
essary to guarantee all the write data records 113 of the 
write time 1 12 on or before a reference time and discard 
all the write data records 1 13 of the later write times 1 12 
than the reference time. Hence, the write data record 

1 12 of the write time 1 1 1 on or before the time a is made 
to be the write data record 112 to be destaged on the 
secondary controller 109. 

[0040] In correspondence with the above, the control 
memory 108 of the secondary controller 109 stores a 
write enable time 500 to the primary controller. The 
write enable time 500 is the information existing in each 
primary controller 104 and is the latest write time 1 1 1 
received from the corresponding primary controller 104. 
As mentioned above, therefore, of these write enable 
times 500, the write data record 112 of the write time 

111 on or before the reference time, that is, the earliest 
time corresponds to the write data record 112 to be 
destaged onto the secondary controller 1 09. 
[0041] Later, about the present embodiment, the 
description will be oriented to the content of each 
processing unit in the case of transferring the write data 

112 from one primary controller 104 to the secondary 
controller 109 in parallel. It goes without saying that the 
present embodiment is effective in the case of serializ- 
ing the transfer of the write data 1 12 from one primary 
controller 104 to the secondary controller 109. 
[0042] The processing flow of each processing unit 
included in the primary controller 104 is likewise to the 
processing flow (the process shown in Fig. 3) executed 
in the case of transferring the write data 1 12 in parallel 
in the first embodiment. 

[0043] In turn, the description will be oriented to the 
processing flow of each processing unit included in the 
secondary controller 1 09. 

[0044] The description will be oriented to the process- 



ing flow of a secondary write data destaging unit b510. 
Herein, the description will be oriented to the difference 
between the processing flow of the secondary write 
data destaging unit 520 included in the second embodi- 
s ment and the processing flow of the secondary write 
data destaging unit 170 included in the first embodi- 
ment. The content of the process executed in the sec- 
ondary write data receiving unit 510 according to the 
second embodiment is executed to check if the corre- 
w sponding write time 1 1 1 is on or before ail the write ena- 
ble times 500 and select the write data record 1 12 for 
meeting the condition (step 511). Except this process, 
the processing flow of the secondary write data destag- 
ing unit b51 0 included in the second embodiment is like- 
is wise to the processing flow of the secondary write data 
destaging unit 170 included in the first embodiment. 
[0045] The secondary reference time receiving unit 
b520 operates to set the reference time received from 
the primary controller 104 as a write enable time 500 of 
so the primary controller 104 having transmitted the refer- 
ence time. 

[0046] In this embodiment, the write data 1 12 to be 
scrapped by the primary fail-time data scrapping unit 
b530 corresponds to the write data 1 12 that does not 
25 meet the condition that the corresponding write time 
1 1 1 is on or before the write enable times 500 of all the 
primary controllers (step 531). 

(C) Third Embodiment 

30 

[0047] In turn, the description will be oriented to the 
third embodiment of the invention. 
[0048] Fig. 6 illustrates a general system configuration 
according to the third embodiment. The difference 
35 between the second and the third embodiments is also 
the numbers of the primary controllers 104 and the sec- 
ondary controllers 109. The system of the third embod- 
iment is configured to have two or more primary 
controllers 104 and one or more secondary controllers 
40 1 09. In this case, each pair of the primary controller 1 04 
and the secondary controller 109 is not required to be 
connected. 

[0049] If two or more secondary controllers 109 are 
provided, in order to erase the intermediate result of a 
45 transaction, it is necessary to keep the reference times 
used for selecting the write data records 112 to be 
destaged common among the secondary controllers 
1 09. This is because the database and the journal may 
be distributed among the secondary controllers 109. 
so [0050] According to this embodiment, a master sec- 
ondary controller 700 provides a function of determining 
the reference time used for selecting the write data 
record 1 1 2 to be destaged. Hence, a data transfer path 
is connected between the master controller 700 and the 
55 other secondary controllers 109. If the data transfer 
path is failed, it is impossible to keep the reference 
times used for selecting the write data records 1 12 to be 
destaged common among the secondary controllers 
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1 09. Hence, multiplexing the data transfer path is prefer- 
able. In this embodiment, the master secondary control- 
ler 700 provides a function of determining the reference 
time used for selecting the write data record 1 12 to be 
destaged. Without providing the specific secondary 5 
controller 1 09 with the function of determining the refer- 
ence time, the method for distributing the function 
among the secondary controllers 109 (for example, the 
method wherein the secondary controllers 109 alter- 
nately determine the reference time) may be employed w 
for implementing the present invention. 
[0051] In correspondence with the above, the control 
memory 109 of the master secondary controller 700 
stores a write time 701 of the secondary controller. This 
write time 701 is the information corresponding to the is 
secondary controllers 109 containing the master sec- 
ondary controller 700. The write time 701 of each sec- 
ondary controller corresponds to the information 
periodically received by the master secondary controller 
700 from each secondary controller 109 at the earliest 20 
time (the reference time on which the secondary con- 
troller 109 selects the secondary controller 109 in the 
second embodiment) among all the primary controller 
write enable times 500 contained in the secondary con- 
troller 109. 25 
[0052] In the third embodiment, the master write time 
702 is a reference time on which each secondary con- 
troller 109 selects the write data. The master write time 
702 is set by the process wherein the master secondary 
controller 700 refers to all the secondary controller write sc 
times 701 at a proper period and selects the earliest 
time. All the write data records 112 having the write 
times 1 1 1 on or before the selected time are contained 
in the secondary controller 109. Hence, by guarantee- 
ing the write data 112 that meet this condition and 3i 
scrapping all the write data 112 that do not meet it, it is 
possible to erase the intermediate result of the transac- 
tion. 

[0053] Hereafter, also in this embodiment, the 
description will be oriented to the content of each 4. 
processing unit in the case of transferring the write data 
112 from one primary controller 104 to the secondary 
controllers 109 in parallel. It goes without saying that 
this embodiment is effective if the transfer of the write 
data 1 1 2 from one primary controller 1 04 to the second- 4 
ary controllers 109 is serialized. 

[0054] The processing flow of each processing unit 
included in the primary controller 104 is substantially 
similar to that of the second embodiment. It goes with- 
out saying that the primary reference time transmitting £ 
unit 170 provides a function of transmitting a reference 
time to each secondary controller 109 for transmitting 
the write data 1 1 2. The reference time to be transmitted 
corresponds to the earliest write time 111 contained in 
the write data managing information 113 in which the 1 
necessity bit 124 of each write data record 112 corre- 
sponding to the secondary controller 109 for transmit- 
ting the reference time is set on. 



[0055] The different respect of the processing flow of 
the third embodiment from that of the second embodi- 
ment is that the secondary controller 109 contains a 
secondary write time transmitting unit 71 0, a secondary 
write data destaging unit c720, and a primary fail-time 
data scrapping point c. and the master secondary con- 
troller 700 contains a master secondary write time 
receiving unit 711, a master write time calculating unit 
712, and a master secondary write time transmitting 
unit 71 3. 

[0056] The secondary write time transmitting unit 710 
operates to transmit the earliest time 180 included in all 
the primary controller write enable times 500 stored in 
the primary controller 109 to the master secondary write 
time receiving unit 711 included in the master second- 
ary controller 700 at a proper period. The secondary 
write time transmitting unit 710 included in the second- 
ary controller 109 except the master secondary control- 
ler 700 makes use of a data transfer path between the 
secondary controllers 109. The secondary write time 
transmitting unit 710 of the master secondary controller 
700 makes use of communicating means provided in 
the master secondary controller 700. 
[0057] The master secondary write time receiving unit 
71 1 operates to set the time received from the second- 
ary write time transmitting unit 710 to the secondary 
controller write. time 701 corresponding to the second- 
ary controller 109 having transmitted the time. 
[0058] The master write time calculating unit 712 
operates to refer to all the secondary controller write 
times 701 , select the earliest time, and then set the time 
as the master write time 702. 

[0059] The master secondary write time transmitting 
unit 71 3 operates to transmit the time set as the master 
write time 702 at a proper period in response to the 
requests issued from the secondary write data destag- 
ing unit 720 and the primary fail-time data scrapping unit 
730 included in each secondary controller 109. The 
secondary controller 109 rather than the master sec- 
ondary controller 700 makes use of the data transfer 
path between the secondary controllers 109. For the 
request issued from the master secondary controller 
700, the master secondary write time transmitting unit 
713 makes use of communicating means provided in 
the master secondary controller 700. 
[0060] The different respect of the secondary write 
data destaging unit 720 from that of the second embod- 
iment-is that when selecting the write data 112 to be 
destaged, the secondary write data destaging unit 720 
operates to receive the reference time from the master 
secondary write time transmitting unit 713 and select 
the write data 112 with the earlier write time 111 than 
the reference time as the write data to be destaged 
(step 721). 

; [0061] The different respect of the primary fail-time 
data scrapping unit c730 from that of the second 
embodiment is that when selecting the write data 1 1 2 to 
be scrapped from the cache memory 107, the primary 
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fail-time data scrapping unit c730 operates to receive 
the reference timet rom the master secondary write time 
transmitting unit 713 and select the write data 112 
except all the write data 112 with the earlier write time 
111 than the reference time as the write data to be 5 
scrapped (step 731). 

[0062] According to this embodiment, the master con- 
troller 700 is arranged to receive the information 
required for calculating the reference time from the sec- 
ondary controller 109. In place, it may be arranged to 10 
receive the information from the primary controller 104 
as shown in Fig. 7. In this case, the control memory 108 
of the master secondary controller 700 stores a master 
primary controller write time 800. The write time 800 
corresponds to the primary controller 104. The write 75 
time 800 is set as follows. The master secondary con- 
troller 700 receives the earliest write time 1 1 1 included 
in all the write data managing information 113 contain- 
ing the necessity bits 124 being set at a proper period 
and sets it as the write time 800 inside of the primary so 
controller 1 09. For the master write time 702, the master 
secondary controller 700 operates to refer to all the pri- 
mary controller write times 701, select the earliest time 
and set it as the master write time 702 at a proper 
period. Like the foregoing embodiments, the reference 25 
time for the destaging or the data scrapping is the mas- 
ter write time 702. 

Claims 

30 

1 . A computer system comprising: 

a processing unit (100); 
a first controller (104) connected to said 
processing unit and having a first cache mem- 35 
ory (108); 

a first disk unit (105) connected to said first 
controller; 

a second controller (109) connected to said 
first controller and having a second cache 40 
memory (108); 

a second disk (105) connected to said second 

controller; and 

wherein 

said processing unit issues a request for write 45 
containing write data and a write time to said 
first controller, 

said first controller (a) stores said write data in 
said first cache memory, (b) reports completion 
of said request for write to said processing unit, so 
(c) stores said write data stored in said first 
cache memory in said first disk unit, and (d) 
transmits said write data and said write time 
stored in said first cache memory to said sec- 
ond controller in the sequence of said write 55 
time, wherein said steps (c) and (d) are exe- 
cuted in parallel, and 

said second controller (e) stores said write data 



and write time transmitted from said first con- 
troller to said second cache memory and (f) 
stores said write data stored in said second 
cache memory to said second disk unit in the 
sequence of said write time. 

2. The computer system according to claim 1 , wherein 
after said step (e) is executed, said second control- 
ler transmits a report on completion to said first 
controller. 

3. The computer system according to claim 1 , wherein 
said first and second cache memories are non-vol- 
atile. 

4. The computer system according to claim 1 , further 
comprising: 

another processing unit; and 

wherein both of said processing units have 

common clocks to each other. 

5. The computer system, comprising: 

a processing unit (100); 
a first controller (104) connected to said 
processing unit and having a first cache mem- 
ory (108); 

a first disk unit (105) connected to said first 
controller; 

a second controller (109) connected to said 
first controller and having a second cache 
memory (108); 

a second disk unit (1 05) connected to said sec- 
ond controller; and 
wherein 

said processing unit issues a request for write 
containing write data and a write time to said 
first controller, 

said first controller (a) stores said write data in 
said first cache memory, (b) reports completion 
of said request for write to said processing unit, 
(c) stores said write data stored in said first 
cache memory to said first disk unit, (d) trans- 
mits plural pairs of said write data and said 
write time and a destageable time stored in 
said first cache memory to said second control- 
ler in the earlier sequence of said write times, 
wherein said steps (c) and (d) are executed in 
parallel, and 

said second controller (e) stores said write data 
and said write time transmitted from said first 
controller in said second cache memory and (f) 
stores in said second disk unit said write data 
corresponding to said earlier write time than 
said destageable time and stored in said sec- 
ond cache memory. 
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6. The computer system according to claim 5, wherein 
after said step (e) is executed, said second control- 
ler transmits a report on the completion to said first 
controller. 

5 

7. The computer system according to claim 5, wherein 
said first and second cache memories are non-vol- 
atile. 

8. The computer system according to claim 5, further io 
comprising: 

another processing unit; and 

wherein both of said processing units have 

common clocks. 15 
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